Experiments in socially guided exploration: lessons learned in building robots that learn with and without human teachers

نویسندگان

  • Andrea Lockerd Thomaz
  • Cynthia Breazeal
چکیده

We present a learning system, Socially Guided Exploration, in which a social robot learns new tasks through a combination of self-exploration and social interaction. The system’s motivational drives, along with social scaffolding from a human partner, bias behavior to create learning opportunities for a hierarchical Reinforcement Learning mechanism. The robot is able to learn on its own, but can flexibly take advantage of the guidance of a human teacher. We report the results of an experiment that analyzes what the robot learns on its own as compared to being taught by human subjects. We also analyze the video of these interactions to understand human teaching behavior and the social dynamics of the human-teacher/robot-learner system. With respect to learning performance, human guidance results in a task set that is significantly more focused and efficient at the tasks the human was trying to teach, while self-exploration results in a more diverse set. Analysis of human teaching behavior reveals insights of social coupling between the human teacher and robot learner, different teaching styles, strong consistency in the kinds and frequency of scaffolding acts across teachers, and nuances in the communicative intent behind positive and negative feedback.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analyzing differences between teachers when learning object affordances via guided exploration

Our work focuses on robots deployed in human environments. These robots, which will need specialized object manipulation skills, should leverage end-users to efficiently learn the affordances of objects in their environment. This approach is promising because prior work has shown that people naturally focus on showing salient aspects of objects when providing demonstrations. In our work, we use...

متن کامل

Active choice of teachers, learning strategies and goals for a socially guided intrinsic motivation learner

Sao Mai Nguyen1 ∗, Pierre-Yves Oudeyer1 † 1 Flowers Team, INRIA and ENSTA ParisTech, France, 200 avenue de la Vieille Tour , 33 405 Talence Cedex, France Abstract We present an active learning architecture that allows a robot to actively learn which data collection strategy is most efficient for acquiring motor skills to achieve multiple outcomes, and generalise over its experience to achieve n...

متن کامل

Learning a Set of Interrelated Tasks by Using a Succession of Motor Policies for a Socially Guided Intrinsically Motivated Learner

We propose an active learning architecture, capable of organizing its learning process to learn complex motor policies (which are succession of primitive motor policies) achieving multiple outcomes: Socially Guided Intrinsic Motivation at High Level (SGIM-HL). The learner can generalize over its experience to continuously learn new outcomes, by choosing actively what and how to learn guided by ...

متن کامل

Learning a Set of Interrelated Tasks by Using a Succession of Motor Policies for a Socially Guided Intrinsically Motivated Learner

We propose an active learning algorithmic architecture, capable of organizing its learning process in order to achieve a field of complex tasks by learning sequences of primitive motor policies : Socially Guided Intrinsic Motivation with Procedure Babbling (SGIM-PB). The learner can generalize over its experience to continuously learn new outcomes, by choosing actively what and how to learn gui...

متن کامل

Learning a Set of Interrelated Tasks by Using a Succession of Motor Policies for a Socially Guided Intrinsically Motivated Learner

We propose an active learning algorithmic architecture, capable of organizing its learning process in order to achieve a field of complex tasks by learning sequences of primitive motor policies : Socially Guided Intrinsic Motivation with Procedure Babbling (SGIM-PB). The learner can generalize over its experience to continuously learn new outcomes, by choosing actively what and how to learn gui...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Connect. Sci.

دوره 20  شماره 

صفحات  -

تاریخ انتشار 2008